Category suggestion for e-commerce queries
ثبت نشده
چکیده
This thesis proposes a category suggestion model for an e-commerce search engine to get an insight on the intent of the visitors. The dataset is provided by the Dutch e-commerce website Bol.com and consists of visitors queries and their subsequent clicks on items. The first part of the model is clustering the dataset to decrease the complexity and find previously unseen similarities between queries. Each query is weighted to all the other queries with a combination of the features lexical matching, word2vec, TF-IDF, co-clicks and a category tree match. The maximum weighted query pair clusters together and the model keeps clustering until it reaches a stop condition. The second part is creating a category distribution for every cluster from the clicks on the items associated with specific categories. The model is then tested by matching a query from the test set to all the clusters with a combination of the two features lexical matching and word2vec. The query inherits the category distribution of the cluster with the highest weight, and it is accurately assigned when the category of the query matches the top category of the inherited distribution. The whole test set is evaluated and results in the accuracy of the model. Various models are created with different combinations of the features and stop conditions. Each is evaluated with the same test set on their accuracy when the test features are varied as well, and the attributes of the clusters of each model are discussed. As are the general observations of all the models. The results vary depending on the importance of accuracy or speed. The model with the highest accuracy uses only the feature word2vec, and the test features is a combination of 70% lexical matching and 30% word2vec, which results in 66% accuracy. However, word2vec is slow, and the fastest model is the same model but only with the test feature lexical matching. The top 1000 queries are thus given categories suggestions, and only one cannot be assigned a category. Observations are made of a few large clusters in the model and the large quantity of one-query clusters. The aspects of the features are also discussed, such as the sparsity of the co-clicks feature. Future work includes adding more features to minimize the number of one-query clusters and optimization of feature selection. In conclusion, the category suggestion model returns proficient category distributions for an unseen query and an insight into the visitors' intent.
منابع مشابه
Mickey Mouse is not a Phrase: Improving Relevance in E-Commerce with Multiword Expressions
We describe a method for detecting phrases in e-commerce queries. The key insight is that previous buyer purchasing behavior as well as the general distribution of phrases in item titles must be used to select phrases. Many multiword expression (mwe) phrases which might be useful in other situations are not suitable for buyer query phrases because relevant items, as measured by purchases, do no...
متن کاملDynamic 3D Visualization of Database-Defined Tree Structures on the WWW by Using Rewriting Systems
This paper presents a new Web-based technique for visualizing dynamically large hierarchies, tree or category structures in 3D worlds. The technique uses CGI-scripting, SQL queries and Applet-based production systems to visualize in an efficient and versatile manner general category structures that are stored in relational databases. First, appropriate SQL queries extract the hierarchical categ...
متن کاملIRIT at CLEF 2015: A Product Search Model for Head Queries
We describe in this paper our participation in the product search task of LL4IR CLEF 2015 Lab. This task aims to evaluate, with living labs protective point of view, the retrieval effectiveness over e-commerce search engines. During the online shopping process, users would search for interesting products and quickly access those that fit with their needs among a long tail of similar or closely ...
متن کاملThe Impact of Query Suggestion in E-Commerce Websites
In this paper we propose a research agenda for studying the impact of query suggestion features on cognitive load and customer satisfaction during online shopping in e-commerce websites. Despite the popular use of query suggestion features in search engines and large e-commerce websites such as Amazon.com and eBay, there is little research in this area. Based on a review on prior literature in ...
متن کاملA Modern Approach to Integrate Database Queries for Searching E-Commerce Product
E-commerce refers to the utilization of electronic data transmission for enhancing business processes and implementing business strategies. Explicit components of e-commerce include providing after-sales services, promoting services/products to services, processing payment, engaging in transaction processes, identifying customer’s needs, processing payment and creating services/products. In rec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2018